Monitoring, Controlling, And Preventing Traffic Congestion Between Processors

ABSTRACT

A system for monitoring congestion at processors includes queues and a congestion monitor. The queues receive packets, and each queue is associated with a processor. For each queue, the congestion monitor establishes whether a time-averaged occupancy of a queue exceeds a time-averaged occupancy threshold. The congestion monitor provides a notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold.

TECHNICAL FIELD

This invention relates generally to the field of telecommunications and more specifically to monitoring, controlling, and preventing traffic congestion between processors.

BACKGROUND

In distributed architectures, processors may communicate packets to one another through a switch. Some architectures use an N-by-(N-1) switch, which has a separate path from each processor to every other processor. Thus, congestion of one path does not affect other paths. N-by-(N-1) switches, however, are typically expensive. Moreover, processors may be added to a switch over time, creating the need for more paths at the switch.

Other architectures use a non-N-by-(N-1) switch, where processors share paths of the switch. Non-N-by-(N-1) switches, however, may experience head-of-line blocking problems. In this situation, if a processor is congested and cannot accept any more packets, packets for the processor remain in a queue that is shared by other processors. Until the waiting packets are taken from the queue by the processor, packets that arrive after the waiting packets cannot be taken by the other processors. This causes a packet buildup in the shared queue. Accordingly, congestion at the one processor may deteriorate service for other processors that share the same queue.

SUMMARY OF THE DISCLOSURE

In accordance with the present invention, disadvantages and problems associated with previous techniques for monitoring congestion at a number of processors may be reduced or eliminated.

According to one embodiment of the present invention, a system for monitoring congestion at processors includes queues and a congestion monitor. The queues receive packets, and each queue is associated with a processor. For each queue, the congestion monitor establishes whether a time-averaged occupancy of a queue exceeds a time-averaged occupancy threshold. The congestion monitor provides a notification to the processor that is sending the traffic if the time-averaged occupancy exceeds the time-averaged occupancy threshold.

Certain embodiments of the invention may provide one or more technical advantages. A technical advantage of one embodiment may be that a congestion monitor may monitor congestion at a number of processors, and may provide a notification if a processor becomes congested. In the embodiment, the congestion monitor may monitor congestion by establishing a time-averaged occupancy for queues buffering packets for the processors. If the time-averaged occupancy exceeds a time-averaged occupancy threshold, the congestion monitor may provide the notification.

Another technical advantage of one embodiment may be that the time-averaged occupancy may take into account whether the number of packets in a queue exceeds a predetermined threshold. In the embodiment, the congestion monitor determines the frequency of excess packets events at a queue, where an excess packets event occurs when the number of packets exceeds a number of packets threshold. The congestion monitor then determines whether the frequency of excess packets events exceeds a frequency threshold.

Another technical advantage of one embodiment may be that the congestion monitor may be provided at a programmable device separate from the switch. Accordingly, the switch need not be redesigned and/or upgraded. Another technical advantage of one embodiment may be that the different threshold values may be adjusted to achieve a desired amount of congestion control and/or congestion prevention.

Certain embodiments of the invention may include none, some, or all of the above technical advantages. One or more other technical advantages may be readily apparent to one skilled in the art from the figures, descriptions, and claims included herein.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the present invention and its features and advantages, reference is now made to the following description, taken in conjunction with the accompanying drawings, in which:

FIG. 1 illustrates one embodiment of a system that includes a congestion monitor and processors;

FIG. 2 illustrates embodiments of a switch and a programmable device that may be used with the system of FIG. 1; and

FIG. 3 illustrates one embodiment of a method for monitoring congestion at a plurality of processors that may be used by the system of FIG. 1.

DETAILED DESCRIPTION OF THE DRAWINGS

Embodiments of the present invention and its advantages are best understood by referring to FIGS. 1 through 3 of the drawings, like numerals being used for like and corresponding parts of the various drawings.

FIG. 1 illustrates one embodiment of a system 10 that includes a congestion monitor 40 and processors 20. According to the embodiment, congestion monitor 40 monitors congestion at processors 20, and may provide a notification if a processor 20 becomes congested. In the embodiment, congestion monitor 40 may monitor congestion by establishing a time-averaged occupancy for queues 36 buffering packets for the processors 20. If the time-averaged occupancy exceeds a time-averaged occupancy threshold, congestion monitor 40 may provide the notification.

In one embodiment, the time-averaged occupancy may take into account whether the number of packets in a queue exceeds a predetermined threshold. Congestion monitor 40 determines the frequency of excess packets events at a queue 36, where an excess packets event occurs when the number of packets exceeds a number of packets threshold. Congestion monitor 40 then determines whether the frequency of excess packets events exceeds a frequency threshold.

According to one embodiment, system 10 communicates packets. A packet comprises a bundle of data organized in a specific way for transmission. Packets may be used to communicate information. Information may refer to voice, data, text, audio, video, multimedia, control, signaling, other information, or any combination of any of the preceding.

According to the illustrated embodiment, system 10 includes processors 20, a switch 24, and a programmable device 28 coupled as shown. Switch 24 and programmable device 28 include ports 32 and queues 36 coupled as shown. Programmable device 28 includes congestion monitor 40 coupled as shown.

In one embodiment, processors 20 perform arithmetic, logic, control, and/or other suitable processing operations. A processor 20 may represent a microprocessor, a central processing unit (CPU), or any other device operable to perform processing operations. In the illustrated embodiment, processors 20 include processors X and Y_(i), where i=1, 2, 3. In one embodiment, processor X may adjust the flow of packets to a processor Y_(i) in response to a notification that processor Y_(i) is congested.

Switch 24 facilitates communication of packets between processor X and processors Y_(i) by transmitting packets through programmable device 28. According to one embodiment, switch 24 may be located at a baseboard. Switch 24 may have one or more ports 32, and programmable device 28 may have one or more queues 36. A port 32 represents a physical interface for a device through which packets may enter or exit the device. In the illustrated embodiment, switch 24 has ports X₁ and X₂. Port X₁ communicates with processor X, and port X₂ communicates programmable device 28.

Queues 36 store packets until the packets may be transmitted elsewhere. Queues 36 may represent temporary memory such as buffers. In the illustrated embodiment, switch 24 has queue X that stores packets from processor X until programmable device 28 is available to accept the packets.

Programmable device 28 facilitates communication of packets between processor X and processors Y_(i) by transmitting packets between switch 24 and processors Y_(i). According to one embodiment, programmable device 28 may represent a programmable semiconductor device such as a field programmable gate array (FPGA) located at a daughter card of a baseboard card of switch 24.

In the illustrated embodiment, programmable device 28 includes port Y and queues Y_(i), where i=1, 2, 3. Port Y receives packets from switch 24. The packets may then be sorted according to their destination processor Y_(i). Queue Y_(i) receives and buffers packets for processor Y_(i).

Congestion monitor 40 monitors the packets at queues Y_(i). If a queue Y_(i) is congested, congestion monitor 40 notifies processor X that processor Y_(i) is congested. In one embodiment, congestion monitor 40 may monitor congestion by establishing a time-averaged occupancy for queues Y_(i). If the time-averaged occupancy exceeds a time-averaged occupancy threshold, congestion monitor 40 may provide the notification. An example of congestion monitor 40 is described in more detail below with reference to FIG. 2.

In one embodiment, the time-averaged occupancy may refer to the average of the number of times the number of packets at a queue exceeds a predetermined threshold for the queue. In the embodiment, congestion monitor 40 may monitor a queue for an excess packets event that occurs when the number of packets exceeds a number of packets threshold. Congestion monitor 40 may count the number of times an excess packets event occurs. If the frequency of these excess packets events exceeds a frequency threshold, congestion monitor 40 may notify processor X.

A component of system 10 may include any suitable arrangement of elements, for example, an interface, logic, memory, other suitable element, or a combination of any of the preceding. An interface receives input, sends output, processes the input and/or output, performs other suitable operation, or performs a combination of any of the preceding. An interface may comprise hardware and/or software.

Logic performs the operations of the component, for example, executes instructions to generate output from input. Logic may include hardware, software, other logic, or a combination of any of the preceding. Certain logic, such as a processor, may manage the operation of a component. Examples of a processor include one or more computers, one or more microprocessors, one or more applications, other logic, or a combination of any of the preceding.

A memory stores information. A memory may comprise computer memory (for example, Random Access Memory (RAM) or Read Only Memory (ROM)), mass storage media (for example, a hard disk), removable storage media (for example, a Compact Disk (CD) or a Digital Video Disk (DVD)), database and/or network storage (for example, a server), other computer-readable medium, or a combination of any of the preceding.

Modifications, additions, or omissions may be made to system 10 without departing from the scope of the invention. The components of system 10 may be integrated or separated. Moreover, the operations of system 10 may be performed by more, fewer, or other components. For example, the operations of congestion monitor 40 may be performed by more than one component. Additionally, operations of system 10 may be performed using any suitable logic. As used in this document, “each” refers to each member of a set or each member of a subset of a set.

FIG. 2 illustrates embodiments of switch 24 and programmable device 28 that may be used by system 10 of FIG. 1. In the illustrated embodiment, a baseboard 42 includes processor X and switch 24 with queue X. A daughter card 46 includes a receiver 50, a transmitter 54, programmable device 28, and processors Y_(i), where i=1, 2, 3. Programmable device 28 includes queues Y_(i), where i=1, 2, 3, and congestion monitor 40.

In one embodiment, receiver 50 receives packets from switch card 24. Receiver 50 may sort packets to send packets for processor Y_(i) to queue Y_(i). The packets may be sorted using the destination address in the header of the packets. Transmitter 54 transmits packets to switch 24.

Congestion monitor 40 monitors the packets at queues Y_(i). If a queue Y_(i) is congested, congestion monitor 40 notifies processor X that processor Y_(i) is congested. In one embodiment, congestion monitor 40 may monitor congestion by establishing a time-averaged occupancy for queues Y_(i). If the time-averaged occupancy exceeds a time-averaged occupancy threshold, congestion monitor 40 may provide the notification. In one embodiment, congestion monitor 40 may monitor a queue for an excess packets event that occurs when the number of packets exceeds a number of packets threshold. If the frequency of excess packets events exceeds a frequency threshold, congestion monitor 40 may notify processor X.

Congestion monitor 40 may include any suitable components to monitor the packets at queues Y_(i). According to the illustrated embodiment, congestion monitor 40 may have monitors 41 a-c for each queue Y_(i). A monitor 41 may include a counting timer 60, counting logic 62, a counter 64, a reset timer 68, and a comparator 74.

Counting timer 60 provides counter timer indicators that indicate when packets at a queue Y_(i) should be measured to establish whether an excess packets event has occurred. Counting timer 60 may be set to any suitable time interval at which counter timer indicators are provided. The time interval may be sufficiently short to collect enough measurements to provide an accurate estimate of the frequency of excess packets events, but may be sufficiently long to avoid wasting resources by collecting too many measurements. Examples of settings for counting timer 60 include less than 50, 40, or 25 microseconds, such as approximately 10 microseconds.

Counting logic 62 measures the packets at a queue Y_(i) to establish whether an excess packets event has occurred. That is, counting logic 62 determines whether the number of packets in queue Y_(i) has exceeded a number of packets threshold 58. Counting logic 62 may measure to packets in response to receiving a counter timer indicator from counting timer 60.

Number of packets threshold 58 may have any suitable value. The value may be less than the number of packets that queue Y_(i) can buffer, and may be selected according to the processing speed of processor Y_(i). The value may be sufficiently small to provide adequate notification of congestion at processor Y_(i), but may be sufficiently large to allow queue Y_(i) to buffer more packets. Number of packets threshold 58 may be the same or different for the different queues Y_(i).

If counting logic 62 determines that an excess packets event has occurred, counting logic 62 updates counter 64. Counter 64 tracks the number of excess packets events that have occurred during a predetermined period, which may be set by reset timer 68.

Reset timer 68 provides reset indicators to counter 64, which sends the number of excess packets events to comparator 72 in response to receiving a reset indicator. Counter 64 may also reset to an initial value, for example, zero, in response to receiving a reset indicator.

Reset timer 68 may provide reset indicators at any suitable time interval. A shorter time interval may provide more accurate and more updated notifications to processor X, but a longer time interval may tend to use less resources. The time interval for reset timer 68 may have any suitable value, for example, a value equal to or larger than the time interval value for counting timer 60. Examples of reset timer time intervals include great than 50, 100, or 200 microseconds, for example, approximately 1 millisecond.

Comparator 72 compares the frequency of excess packets events with a frequency threshold 70, and notifies processor X if the frequency exceeds the threshold. Frequency threshold 70 indicates the level at which processor X should be notified. Frequency threshold 70 may have any suitable value, and may have a lower value for more frequent notification or a higher value for less frequent notification.

The notification may be provided in any suitable manner. For example, the notification may be provided via an interrupt that is routed from programmable device 28 to processor X. As another example, the notification may be provided by sending a pre-programmed notification packet. The notification may identify which processor Y_(i) is congested.

In one embodiment, processor X may be operable to prioritize the flow of packets, for example, slow the flow of packets to the identified processor Y_(i) and/or buffer packets. The prioritization may reduce or prevent congestion and/or head-of-line blocking problems.

Values of number of packets threshold 58, frequency threshold 70, counting timer interval, and/or reset timer interval may be selected to manage congestion at processors Y_(i). For example, congestion prevention may prevent congestion before congestion occurs, and congestion control may reduce the effect of congestion after it occurs.

Modifications, additions, or omissions may be made to congestion monitor 40 without departing from the scope of the invention. The components of congestion monitor 40 may be integrated or separated. Moreover, the operations of congestion monitor 40 may be performed by more, fewer, or other components. For example, the operations of counting timer 60 and reset timer 68 may be performed by one component, or the operations of counting logic 62 may be performed by more than one component. Additionally, operations of congestion monitor 40 may be performed using any suitable logic.

FIG. 3 illustrates one embodiment of a method for monitoring congestion at a plurality of processors that may be used by system 10 of FIG. 1. The method begins at step 110, where processor X sends packets to switch 24. Switch 24 sends the packets to programmable device 28 at step 112.

Queues Y_(i) receive the packets at step 114. Counting timer 60 may provide a counting timer indicator at step 118. If no counting timer indicator is provided, the method returns to step 114, where queues Y_(i) continue to receive packets. If a counting timer indicator is provided, the method proceeds to step 122.

Counting logic 62 measures the number of packet in queues Y_(i) at step 122. The number of packets in the queues Y_(i) may exceed number of packets threshold 58 at step 126, indicating that an excess packets event has occurred. If threshold 58 has been exceeded, the method proceeds to step 130, where counter 64 is incremented. The method then proceeds to step 134. If threshold 58 has not been exceeded, the method proceeds directly to step 134.

Reset timer 68 may provide a reset timer indicator at step 134. If no reset timer indicator is provided, the method returns to step 114, where queues Y_(i) continue to receive packets. If a reset timer indicator is provided, the method proceeds to step 138.

Comparators 72 compare the frequency of excess packet occurrences with a frequency threshold 70 at step 138. The frequency may exceed threshold 70 at step 142. If threshold 70 has been exceeded, congestion monitor 40 notifies processor X at step 146. Congestion monitor 40 may notify processor X that a particular processor Y_(i) is congested. The method then proceeds to step 150. If threshold 70 has not been exceeded, the method proceeds directly to step 150.

Counter 64 is reset at step 150. Programmable device 28 may receive more packets are step 154. If there are more packets, the method returns to step 114, where queues Y_(i) receive the packets. If there are no more packets, the method terminates.

Modifications, additions, or omissions may be made to the method without departing from the scope of the invention. The method may include more, fewer, or other steps. Additionally, steps may be performed in any suitable order.

Certain embodiments of the invention may provide one or more technical advantages. A technical advantage of one embodiment may be that a congestion monitor may monitor congestion at a number of processors, and may provide a notification if a processor becomes congested. In the embodiment, the congestion monitor may monitor congestion by establishing a time-averaged occupancy for queues buffering packets for the processors. If the time-averaged occupancy exceeds a time-averaged occupancy threshold, the congestion monitor may provide the notification.

Another technical advantage of one embodiment may be that the time-averaged occupancy may take into account whether the number of packets in a queue exceeds a predetermined threshold. In the embodiment, the congestion monitor determines the frequency of excess packets events at a queue, where an excess packets event occurs when the number of packets exceeds a number of packets threshold. The congestion monitor then determines whether the frequency of excess packets events exceeds a frequency threshold.

Another technical advantage of one embodiment may be that the congestion monitor may be provided at a programmable device separate from the switch. Accordingly, the switch need not be redesigned and/or upgraded. Another technical advantage of one embodiment may be that the different threshold values may be adjusted to achieve a desired amount of congestion control and/or congestion prevention.

Although this disclosure has been described in terms of certain embodiments, alterations and permutations of the embodiments will be apparent to those skilled in the art. Accordingly, the above description of the embodiments does not constrain this disclosure. Other changes, substitutions, and alterations are possible without departing from the spirit and scope of this disclosure, as defined by the following claims. 

1. A method for monitoring congestion at a plurality of processors, comprising: receiving a plurality of packets at a plurality of queues, each queue associated with a processor of a plurality of processors; and performing the following for each queue of the plurality of queues: establishing whether a time-averaged occupancy of the each queue exceeds a time-averaged occupancy threshold; and providing a notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold.
 2. The method of claim 1, wherein establishing whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold further comprises: establishing whether a frequency of excess packets events exceeds a frequency threshold, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold.
 3. The method of claim 1, wherein establishing whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold further comprises: performing the following for a specified number of iterations: establishing whether there is an excess packets event, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and counting the number of excess packets events to establish a frequency of excess packets events.
 4. The method of claim 1, wherein establishing whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold further comprises: performing the following for each counter timer indicator of a plurality of counter timer indicators: establishing whether there is an excess packets event in response to the each counter timer indicator, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and incrementing an excess packets event counter if there is an excess packets event.
 5. The method of claim 1, wherein receiving the plurality of packets at the plurality of queues further comprises: receiving the plurality of packets from a sender processor through a switch.
 6. The method of claim 1, wherein providing the notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold further comprises: providing the notification to a sender processor, the notification identifying the processor associated with the each queue, the sender processor operable to adjust a flow of packets to the identified processor in response to the notification.
 7. A system for monitoring congestion at a plurality of processors, comprising: a plurality of queues operable to receive a plurality of packets, each queue associated with a processor of a plurality of processors; and a congestion monitor coupled to the plurality of queues and operable to perform the following for each queue of the plurality of queues: establish whether a time-averaged occupancy of the each queue exceeds a time-averaged occupancy threshold; and provide a notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold.
 8. The system of claim 7, the congestion monitor further operable to establish whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold by: establishing whether a frequency of excess packets events exceeds a frequency threshold, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold.
 9. The system of claim 7, the congestion monitor further operable to establish whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold by: performing the following for a specified number of iterations: establishing whether there is an excess packets event, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and counting the number of excess packets events to establish a frequency of excess packets events.
 10. The system of claim 7, the congestion monitor further operable to establish whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold by: performing the following for each counter timer indicator of a plurality of counter timer indicators: establishing whether there is an excess packets event in response to the each counter timer indicator, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and incrementing an excess packets event counter if there is an excess packets event.
 11. The system of claim 7, the plurality of queues further operable to receive the plurality of packets at the plurality of queues by: receiving the plurality of packets from a sender processor through a switch.
 12. The system of claim 7, the congestion monitor further operable to provide the notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold by: providing the notification to a sender processor, the notification identifying the processor associated with the each queue, the sender processor operable to adjust a flow of packets to the identified processor in response to the notification.
 13. Logic for monitoring congestion at a plurality of processors, the logic embodied in computer-readable storage media and operable to: receive a plurality of packets at a plurality of queues, each queue associated with a processor of a plurality of processors; and perform the following for each queue of the plurality of queues: establish whether a time-averaged occupancy of the each queue exceeds a time-averaged occupancy threshold; and provide a notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold.
 14. The logic of claim 13, further operable to establish whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold by: establishing whether a frequency of excess packets events exceeds a frequency threshold, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold.
 15. The logic of claim 13, further operable to establish whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold by: performing the following for a specified number of iterations: establishing whether there is an excess packets event, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and counting the number of excess packets events to establish a frequency of excess packets events.
 16. The logic of claim 13, further operable to establish whether the time-averaged occupancy of the each queue exceeds the time-averaged occupancy threshold by: performing the following for each counter timer indicator of a plurality of counter timer indicators: establishing whether there is an excess packets event in response to the each counter timer indicator, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and incrementing an excess packets event counter if there is an excess packets event.
 17. The logic of claim 13, further operable to receive the plurality of packets at the plurality of queues by: receiving the plurality of packets from a sender processor through a switch.
 18. The logic of claim 13, further operable to provide the notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold by: providing the notification to a sender processor, the notification identifying the processor associated with the each queue, the sender processor operable to adjust a flow of packets to the identified processor in response to the notification.
 19. A system for monitoring congestion at a plurality of processors, comprising: means for receiving a plurality of packets at a plurality of queues, each queue associated with a processor of a plurality of processors; and means for performing the following for each queue of the plurality of queues: establishing whether a time-averaged occupancy of the each queue exceeds a time-averaged occupancy threshold; and providing a notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold.
 20. A system for monitoring congestion at a plurality of processors, comprising: a plurality of queues operable to receive a plurality of packets, each queue associated with a processor of a plurality of processors, the plurality of packets received from a sender processor through a switch; and a congestion monitor coupled to the plurality of queues and operable to perform the following for each queue of the plurality of queues: establish whether a time-averaged occupancy of the each queue exceeds a time-averaged occupancy threshold by: establishing whether a frequency of excess packets events exceeds a frequency threshold, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; performing the following for each counter timer indicator of a plurality of counter timer indicators: establishing whether there is an excess packets event in response to the each counter timer indicator, an excess packets event occurring when a number of packets at the each queue exceeds a number of packets threshold; and incrementing an excess packets event counter if there is an excess packets event; and provide a notification if the time-averaged occupancy exceeds the time-averaged occupancy threshold by: providing the notification to the sender processor, the notification identifying the processor associated with the each queue, the sender processor operable to adjust a flow of packets to the identified processor in response to the notification. 